首页> 外文OA文献 >AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
【2h】

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

机译:adascan:深度卷积神经网络中的自适应扫描池   视频中的人类行为识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We propose a novel method for temporally pooling frames in a video for thetask of human action recognition. The method is motivated by the observationthat there are only a small number of frames which, together, containsufficient information to discriminate an action class present in a video, fromthe rest. The proposed method learns to pool such discriminative andinformative frames, while discarding a majority of the non-informative framesin a single temporal scan of the video. Our algorithm does so by continuouslypredicting the discriminative importance of each video frame and subsequentlypooling them in a deep learning framework. We show the effectiveness of ourproposed pooling method on standard benchmarks where it consistently improveson baseline pooling methods, with both RGB and optical flow based Convolutionalnetworks. Further, in combination with complementary video representations, weshow results that are competitive with respect to the state-of-the-art resultson two challenging and publicly available benchmark datasets.
机译:我们提出了一种新的方法,用于在视频中临时合并帧中的人类动作识别任务。该方法是由于观察到只有少量帧,这些帧一起包含足够的信息来将视频中存在的动作类别与其余的区别开来。所提出的方法学会合并这些区分性和信息性帧,同时在视频的单个时间扫描中丢弃大多数非信息性帧。我们的算法通过不断预测每个视频帧的区别重要性,然后将它们合并到深度学习框架中来实现。我们在标准基准上显示了我们提出的合并方法的有效性,该方法通过基于RGB和基于光流的卷积网络不断改进了基准合并方法。此外,结合互补的视频表示,我们在两个具有挑战性且可公开获得的基准数据集上显示了与最新结果相比具有竞争力的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号